Logical and Physical Data Independence for Native Scientific Data Repositories

نویسندگان

  • Bill Howe
  • David Maier
چکیده

Many datasets in the physical sciences, especially the results of simulations, are defined over a topological grid structure. Applications in these domains would benefit from a principled interface to gridded datasets via a specialized data model. Traditionally, benefits of a data model are realized only after data is ensconced within a managed database environment. However, massive bulk-loading and reloading operations in large-scale data repositories are prohibitively expensive. Instead, we superimpose a specialized data model over native data repositories stored on directly on OS filesystems rather than managed by a database system. Views in a specialized data model can be defined via references to native directory structures and file content, providing physical and logical data independence. This non-intrusive approach appears to reduce space requirements, speed development, and cooperate with legacy applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing a logical data model of athletes' hospital information management system based on international standards

Background and purpose: Today, hospital care, relying on the health record system, has received more attention than before. Considering the diversity of data in these systems, the design of standard conceptual and logical models by service providers will play an important role in their success. Therefore, this research was conducted with the aim of designing a logical data model of the hospital...

متن کامل

Arnold: Declarative Crowd-Machine Data Integration

The availability of rich data from sources such as the World Wide Web, social media, and sensor streams is giving rise to a range of applications that rely on a clean, consistent, and integrated database built over these sources. Human input, or crowd-sourcing, is an effective tool to help produce such high-quality data. It is infeasible, however, to involve humans at every step of the data cle...

متن کامل

Efficient Storage of XML Data

NATIX is an efficient, native repository for storing, retrieving and managing XML documents. Other systems map XML data into structures maintainable by traditional DBMS. This introduces additional layers between the logical data and its physical storage, slowing down both updates and query processing. NATIX is native in the sense that it supports tree-structured objects like XML documents at lo...

متن کامل

Towards a Universal Media Server

In this report, we give an overview of the SFB 501 approach on developing abstractions and concepts needed to achieve the long-term objective of realizing a genuine universal media server. Such a media server is supposed to supplement repositories of any kind (e. g., digital libraries) by providing a high-level interface that allows applications to access and process media data over a network (...

متن کامل

Retrofitting a Data Model to Existing Environmental Data

Environmental data repositories are frequently stored as a collection of packed binary files arranged in an intricate directory structure, rather than in a database. In previous work, we 1) show that environmental data is often logically equipped with a topological grid structure and 2) provide a data model and algebra of gridfields for manipulating such gridded datasets. In this paper, we show...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 27  شماره 

صفحات  -

تاریخ انتشار 2004